An Enhanced Adaptive Scoring Job Scheduling Algorithm with Replication Strategy in Grid Environment
نویسنده
چکیده
Grid computing is a form of distributed computing that involves coordinating and sharing data storage and network resource. The goal of grid job scheduling is to achieve high system throughput and match the job to the appropriate available computing resource. The complexity of scheduling problem increases with heterogeneous nature of grid and is highly difficult to schedule effectively. Existing algorithm does not adapt to the dynamic grid environment. In order to utilize the power of grid completely and to assign job to the resource dynamically an efficient algorithm called Adaptive Scoring Job Scheduling (ASJS) was introduced. However the bandwidth and storage capacity occupied by data intensive and computational intensive job is high and each time the user have to specify whether the job is computational intensive or data intensive. . Due to this problem the jobs are not completed in time. To provide a solution to that problem Enhanced Adaptive Scoring Job scheduling algorithm is introduced. The jobs are identified whether it is data intensive or computational intensive and based on that the jobs are scheduled. The jobs are allocated by computing Cluster Score (CS). The jobs that are submitted by the user is divided into sub tasks and replicated. By using this strategy the job occupies lower storage capacity and bandwidth. Due to the dynamic nature of grid environment, each time the status of the resources changes and each time the Cluster Score (CS) is computed and the jobs are replicated and allocated to the most appropriate resources.
منابع مشابه
Hierarchical Replication Strategy for Adaptive Scoring Job Scheduling in Grid Computing
Grid technology, which together a number of personal computer clusters with high speed networks, can reach the same computing power as a supercomputer does, also with a minimum cost. However, heterogeneous system is called as grid. Scheduling independent tasks on grid is more difficult. In order to utilize the power of grid completely, we demand an efficient job scheduling algorithm to execute ...
متن کاملAn Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity
The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...
متن کاملA New Job Scheduling in Data Grid Environment Based on Data and Computational Resource Availability
Data Grid is an infrastructure that controls huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. The heterogeneity and geographic dispersion of grid resources and applications place some complex problems such as job scheduling. Most existing scheduling algorithms in Grids only focus on one kind of Grid jobs which can be data...
متن کاملData Replication-Based Scheduling in Cloud Computing Environment
Abstract— High-performance computing and vast storage are two key factors required for executing data-intensive applications. In comparison with traditional distributed systems like data grid, cloud computing provides these factors in a more affordable, scalable and elastic platform. Furthermore, accessing data files is critical for performing such applications. Sometimes accessing data becomes...
متن کاملDynamic Replication based on Firefly Algorithm in Data Grid
In data grid, using reservation is accepted to provide scheduling and service quality. Users need to have an access to the stored data in geographical environment, which can be solved by using replication, and an action taken to reach certainty. As a result, users are directed toward the nearest version to access information. The most important point is to know in which sites and distributed sy...
متن کامل